NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Introduction to the Minitrack on Interactive Visual Analytics for Knowledge Integration and Decision Intelligence

https://doi.org/10.24251/HICSS.2023.149

Fisher, Brian; Gaither, Kelly; Ebert, David (January 2023, Proceedings of the 56th Hawaii International Conference on System Sciences)

Full Text Available
Optimal mode of delivery in pregnancy: Individualized predictions using national vital statistics data

https://doi.org/10.1371/journal.pdig.0000166

Schulz, Karl W.; Gaither, Kelly; Zigler, Corwin; Urban, Tomislav; Drake, Justin; Bukowski, Radek (December 2022, PLOS Digital Health)
Rappoport, Nadav (Ed.)
Child birth via Cesarean section accounts for approximately 32% of all births each year in the United States. A variety of risk factors and complications can lead caregivers and patients to plan for a Cesarean delivery in advance before onset of labor. However, a non-trivial subset of Cesarean sections (∼25%) are unplanned and occur after an initial trial of labor is attempted. Unfortunately, patients who deliver via unplanned Cesarean sections have increased maternal morbidity and mortality rates and higher rates of neonatal intensive care admissions. In an effort to develop models aimed at improving health outcomes in labor and delivery, this work seeks to explore the use of national vital statistics data to quantify the likelihood of an unplanned Cesarean section based on 22 maternal characteristics. Machine learning techniques are used to ascertain influential features, train and evaluate models, and assess accuracy against available test data. Based on cross-validation results from a large training cohort ( n = 6,530,467 births), the gradient-boosted tree algorithm was identified as the best performer and was evaluated on a large test cohort ( n = 10,613,877 births) for two prediction scenarios. Area under the receiver operating characteristic curves of 0.77 or higher and recall scores of 0.78 or higher were obtained and the resulting models are well calibrated. Combined with feature importance analysis to explain why certain maternal characteristics lead to a specific prediction in individual patients, the developed analysis pipeline provides additional quantitative information to aid in the decision process on whether to plan for a Cesarean section in advance, a substantially safer option among women at a high risk of unplanned Cesarean delivery during labor.
more » « less
Full Text Available
Collecting and analyzing smartphone sensor data for health

https://doi.org/10.1145/3437359.3465599

Drake, Justin; Schulz, Karl; Bukowski, Radek; Gaither, Kelly (July 2021, PEARC '21: Practice and Experience in Advanced Research Computing)
null (Ed.)
Evidence suggests that signatures of health and disease, or digital biomarkers, exist within the heterogeneous, temporally-dense data gathered from smartphone sensors and wearable devices that can be leveraged for medical applications. Modern smartphones contain a collection of energy-efficient sensors capable of capturing the device’s movement, orientation, and location as well characteristics of its external environment (e.g. ambient temperature, sound, pressure). When paired with peripheral wearable devices like smart watches, smartphones can also facilitate the collection/aggregation of important vital signs like heart rate and oxygen saturation. Here we discuss our recent experiences with deploying an open-source, cloud-native framework to monitor and collect smartphone sensor data from a cohort of pregnant women over a period of one year. We highlight two open-source integrations into the pipeline we found particularly useful: 1) a dashboard–built with Grafana and backed by Graphite–to monitor and manage production server loads and data collection metrics across the study cohort and 2) a back-end storage solution with InfluxDB, a multi-tenant time series database and data exploration ecosystem, to support biomarker discovery efforts of a multidisciplinary research team.
more » « less
Full Text Available
The Role Cultural Competency Plays in Teaching Data Science

https://doi.org/10.1145/3219104.3219160

Gaither, Kelly; Gomez, Rosalia; Turner, Helen; DeStefano, Lizanne; Rivera, Lorna; Bland, Marques (July 2018, PEARC '18 Proceedings of the Practice and Experience on Advanced Research Computing)

Supporting Pacific Indigenous Computing Excellence (SPICE) is based on unique expertise and proven models established through a partnership between the Texas Advanced Computing Center at the University of Texas at Austin, Chaminade University of Honolulu and Georgia Institute of Technology (Georgia Tech). The SPICE program leverages shared partnership experiences to address two goals: 1) Perform original research and program development to bridge computation and culture -- developing culturally-consistent conceptual and practical frameworks for thinking about big data problems and communicating student outcomes and attainment to family, community and kupuna (Hawaiian wisdom figures); and 2) Implement an in situ Data Science, Analytics and Visualization (DSAV) Summer Immersion Experience (SIE) as a summer program in Hawai‘i to provide a month-long summer immersion program in data science, visualization, and virtual reality to Native Hawaiian and Pacific Islander (NHPI) and disadvantaged students. In this paper, we present the framework for this effort, with relevant educational, and cultural research to justify decisions made to date.
more » « less
Full Text Available
Stampede 2: The Evolution of an XSEDE Supercomputer

https://doi.org/10.1145/3093338.3093385

Stanzione, Dan; Barth, Bill; Gaffney, Niall; Gaither, Kelly; Hempel, Chris; Minyard, Tommy; Mehringer, S.; Wernert, Eric; Tufo, H.; Panda, D.; et al (July 2017, Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact (PEARC17).)

Full Text Available
Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States

https://doi.org/10.1073/pnas.2113561119

Cramer, Estee Y.; Ray, Evan L.; Lopez, Velma K.; Bracher, Johannes; Brennen, Andrea; Castro Rivadeneira, Alvaro J.; Gerding, Aaron; Gneiting, Tilmann; House, Katie H.; Huang, Yuxin; et al (April 2022, Proceedings of the National Academy of Sciences)

Short-term probabilistic forecasts of the trajectory of the COVID-19 pandemic in the United States have served as a visible and important communication channel between the scientific modeling community and both the general public and decision-makers. Forecasting models provide specific, quantitative, and evaluable predictions that inform short-term decisions such as healthcare staffing needs, school closures, and allocation of medical supplies. Starting in April 2020, the US COVID-19 Forecast Hub ( https://covid19forecasthub.org/ ) collected, disseminated, and synthesized tens of millions of specific predictions from more than 90 different academic, industry, and independent research groups. A multimodel ensemble forecast that combined predictions from dozens of groups every week provided the most consistently accurate probabilistic forecasts of incident deaths due to COVID-19 at the state and national level from April 2020 through October 2021. The performance of 27 individual models that submitted complete forecasts of COVID-19 deaths consistently throughout this year showed high variability in forecast skill across time, geospatial units, and forecast horizons. Two-thirds of the models evaluated showed better accuracy than a naïve baseline model. Forecast accuracy degraded as models made predictions further into the future, with probabilistic error at a 20-wk horizon three to five times larger than when predicting at a 1-wk horizon. This project underscores the role that collaboration and active coordination between governmental public-health agencies, academic modeling teams, and industry partners can play in developing modern modeling capabilities to support local, state, and federal response to outbreaks.
more » « less
Full Text Available
The United States COVID-19 Forecast Hub dataset

https://doi.org/10.1038/s41597-022-01517-w

Cramer, Estee Y.; Huang, Yuxin; Wang, Yijin; Ray, Evan L.; Cornell, Matthew; Bracher, Johannes; Brennen, Andrea; Rivadeneira, Alvaro J.; Gerding, Aaron; House, Katie; et al (December 2022, Scientific Data)

Abstract Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages.
more » « less
Full Text Available

Search for: All records